Component-based Generalized Linear Regression using a PLS-extended variant of the Fisher scoring algorithm
نویسندگان
چکیده
In the current estimation of a GLM model, the correlation structure of regressors is not used as the basis on which to lean strong predictive dimensions. Looking for linear combinations of regressors that merely maximize the likelihood of the GLM has two major consequences: 1) collinearity of regressors is a factor of estimation instability, and 2) as predictive dimensions may lean on noise, both predictive and explanatory powers of the model are jeopardized. For a single dependent variable, attempts have been made to adapt PLS Regression, which solves this problem in the classical Linear Model, to GLM estimation. In this paper, we first discuss the methods thus developed, and then propose an algorithm that combines PLS regression with GLM estimation in the multivariate context, under a conditional independence assumption. Our algorithm is tested on simulated data. ∗Corresponding author Tel : +33 (0)467 144 164, Fax : +33 (0)467 143 558 Email addresses: [email protected] (X. Bry), [email protected] (C. Trottier), [email protected] (T. Verron) Preprint submitted to Elsevier September 30, 2011 ha l-0 06 64 77 8, v er si on 1 31 J an 2 01 2
منابع مشابه
PLS generalized linear regression Application to the analysis of life time data
Problems encountered in multiple regression due to multicolinearity or missing data can be overcome by using PLS regression. Several versions of the PLS regression algorithm exist. In this paper, we present a new version of this algorithm which can be extended to generalized linear regression models such as ordinal or multinomial logistic regression, generalized linear models, and Cox regressio...
متن کاملExponential Dispersion Models and the Gauss-Newton Algorithm
It is well known that the Fisher scoring iteration for generalized linear models has the same form as the Gauss-Newton algorithm for normal regression. This note shows that exponential dispersion models are the most general families to preserve this form for the scoring iteration. Therefore exponential dispersion models are the most general extension of generalized linear models for which the a...
متن کاملA Methodological Comparison between PLS Path Modeling and Generalized Structured Component Analysis
PLS Path Modeling and Generalized Structured Component Analysis are two component-based approaches to Structural Equation Models. Both methods aim at estimating the causal relationships linking two or more latent variables by means of a set of observed indicators. Moreover, both methods define the latent variable as a linear combination of its own observed variables, i.e. as a component. The ma...
متن کاملPredictive model building for microarray data using generalized partial least squares model
Microarray technology enables simultaneously monitoring the expression of hundreds of thousands of genes in an entire genome. This results in the microarray data with the number of genes p far exceeding the number of samples n. Traditional statistical methods do not work well when n p. Dimension reduction methods are often required before applying standard statistical methods, popular among the...
متن کاملA Uniied Approach to Pca, Pls, Mlr and Cca
This paper presents a novel algorithm for analysis of stochastic processes. The algorithm can be used to nd the required solutions in the cases of principal component analysis (PCA), partial least squares (PLS), canonical correlation analysis (CCA) or multiple linear regression (MLR). The algorithm is iterative and sequential in its structure and uses on-line stochastic approximation to reach a...
متن کامل